Extensive feature detection of N-terminal protein sorting signals
نویسندگان
چکیده
MOTIVATION The prediction of localization sites of various proteins is an important and challenging problem in the field of molecular biology. TargetP, by Emanuelsson et al. (J. Mol. Biol., 300, 1005-1016, 2000) is a neural network based system which is currently the best predictor in the literature for N-terminal sorting signals. One drawback of neural networks, however, is that it is generally difficult to understand and interpret how and why they make such predictions. In this paper, we aim to generate simple and interpretable rules as predictors, and still achieve a practical prediction accuracy. We adopt an approach which consists of an extensive search for simple rules and various attributes which is partially guided by human intuition. RESULTS We have succeeded in finding rules whose prediction accuracies come close to that of TargetP, while still retaining a very simple and interpretable form. We also discuss and interpret the discovered rules.
منابع مشابه
Human housekeeping genes are compact.
Genome Biol. 2, 1018 13 Gray, M.W. et al. (1999) Mitochondrial evolution. Science 283, 1476–1481 14 Lang, B.F. et al. (1999) A comparative genomics approach to the evolution of eukaryotes and their mitochondria. J. Eukaryot. Microbiol. 46, 320–326 15 Henze, K. and Martin, W. (2001) How do mitochondrial genes get into the nucleus? Trends Genet. 17, 383–387 16 Chinnery, P.F. (2003) Searching for ...
متن کاملBioinformatics Analysis of Physichemical Properties of Protein Sorting Signals
Subcellular localization of proteins is usually guided by their sorting signals encoded by subsequences of amino acids at the N-terminal or C-terminal ends. These signals are usually composed of a set of physichemically conserved amino acid groups such as the hydrophoblic cores of secretory signal peptides. Using experimentally determined sorting signals, biologists have identified the physiche...
متن کاملDevelopment of a recombinant protein-based dot-blot hybridization assay for the detection of antibody to chicken infectious bronchitis virus
Nucleocapsid (N) protein of infectious bronchitis virus (IBV), one of the viral structural proteins, inducesstrong antibody response in natural infection. In this study, a simple, recombinant N protein-based dot-blottest was developed to serologically examine chicken serum samples for the presence of IBV antibody.Initially, 72 serum samples were tested for the presence of IBV antibody using a c...
متن کاملIntra-ER sorting of the peroxisomal membrane protein Pex3 relies on its luminal domain
Pex3 is an evolutionarily conserved type III peroxisomal membrane protein required for peroxisome formation. It is inserted into the ER membrane and sorted via an ER subdomain (the peroxisomal ER, or pER) to peroxisomes. By constructing chimeras between Pex3 and the type III ER membrane protein Sec66, we have been able to separate the signals that mediate insertion of Pex3 into the ER from thos...
متن کاملPrediction of N-terminal protein sorting signals.
Recently, neural networks have been applied to a widening range of problems in molecular biology. An area particularly suited to neural-network methods is the identification of protein sorting signals and the prediction of their cleavage sites, as these functional units are encoded by local, linear sequences of amino acids rather than global 3D structures.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 18 2 شماره
صفحات -
تاریخ انتشار 2002